[Bugfix] Fix triton import with local TritonPlaceholder #17446

MengqingCao · 2025-04-30T05:41:02Z

Fix triton import error in non-triton platforms with the local TritonPlaceholder

github-actions · 2025-04-30T05:41:14Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

mergify · 2025-05-02T07:45:42Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @MengqingCao.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: Mengqing Cao <cmq0113@163.com>

MengqingCao · 2025-05-02T12:30:17Z

Plz help to review this pr, thanks! @Isotr0py @zou3519 @houseroad

### What this PR does / why we need it? Re-patch TritonPlaceholder on main to make CI happy - Add triton patch back until vllm-project/vllm#17446 resolved - Move patch_main before patch_common to resolve minicpm triton import issue - Add `0.8.5` and `0.8.5.post1` to make patch work on 0.8.5 all versions Related: - #704 - #690 ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? All CI passed include main Signed-off-by: Yikun Jiang <yikunkero@gmail.com>

houseroad

The changes make sense to me. Left one inline comment.

houseroad · 2025-05-06T00:43:19Z

vllm/triton_utils/__init__.py

@@ -1,5 +1,13 @@
 # SPDX-License-Identifier: Apache-2.0

-from vllm.triton_utils.importing import HAS_TRITON
+from vllm.triton_utils.importing import (HAS_TRITON, TritonLanguagePlaceholder,


can we add some unittest for this placeholder logic?

Sure, I'll add it soon

Signed-off-by: Mengqing Cao <cmq0113@163.com>

MengqingCao · 2025-05-06T08:43:12Z

@youkaichao @Isotr0py Sorry for bothering you, CI failed due to unrelated uts (tested locally without this pr, and same timeout error raised)

could we fix them in next pr and merge this first?

mgoin · 2025-05-06T09:55:46Z

Could we add a pre-commit or GHA check that there are no new import triton being added in PRs?

### What this PR does / why we need it? - Revert "Re-patch TritonPlaceholder on main to make CI happy (#753)" because upstream main CI already merged: vllm-project/vllm#17446 - Keep 0.8.5.post1 compatible ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed --------- Signed-off-by: Yikun Jiang <yikunkero@gmail.com>

MengqingCao · 2025-05-06T11:07:08Z

Could we add a pre-commit or GHA check that there are no new import triton being added in PRs?

Thanks for this good catch, I'll add it soon.

* [Model] Add GraniteMoeHybrid 4.0 model (vllm-project#17497) Signed-off-by: Thomas Ortner <boh@zurich.ibm.com> Signed-off-by: Stanislaw Wozniak <stw@zurich.ibm.com> Co-authored-by: Thomas Ortner <boh@zurich.ibm.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by: Tyler Michael Smith <tysmith@redhat.com> * [easy] Fix logspam on PiecewiseBackend errors (vllm-project#17138) Signed-off-by: rzou <zou3519@gmail.com> * [Bugfix] Fixed prompt length for random dataset (vllm-project#17408) Signed-off-by: Mikhail Podvitskii <podvitskiymichael@gmail.com> * [Doc] Update notes for H2O-VL and Gemma3 (vllm-project#17219) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> * [Misc] Fix ScalarType float4 naming (vllm-project#17690) Signed-off-by: Lucas Wilkinson <lwilkinson@neuralmagic.com> * Fix `dockerfilegraph` pre-commit hook (vllm-project#17698) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> * [Bugfix] Fix triton import with local TritonPlaceholder (vllm-project#17446) Signed-off-by: Mengqing Cao <cmq0113@163.com> * [V1] Enable TPU V1 backend by default (vllm-project#17673) Signed-off-by: mgoin <mgoin64@gmail.com> * [V1][PP] Support PP for MultiprocExecutor (vllm-project#14219) Signed-off-by: jiang1.li <jiang1.li@intel.com> Signed-off-by: jiang.li <jiang1.li@intel.com> * [v1] AttentionMetadata for each layer (vllm-project#17394) Signed-off-by: Chen Zhang <zhangch99@outlook.com> * [Feat] Add deprecated=True to CLI args (vllm-project#17426) Signed-off-by: Aaron Pham <contact@aarnphm.xyz> * [Docs] Use gh-file to add links to tool_calling.md (vllm-project#17709) Signed-off-by: windsonsea <haifeng.yao@daocloud.io> * [v1] Introduce KVCacheBlocks as interface between Scheduler and KVCacheManager (vllm-project#17479) Signed-off-by: Chen Zhang <zhangch99@outlook.com> * [doc] Add RAG Integration example (vllm-project#17692) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com> * [Bugfix] Fix modality limits in vision language example (vllm-project#17721) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> * Make right sidebar more readable in "Supported Models" (vllm-project#17723) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> * [TPU] Increase block size and reset block shapes (vllm-project#16458) * [Misc] Add Next Edit Prediction (NEP) datasets support in `benchmark_serving.py` (vllm-project#16839) Signed-off-by: dtransposed <damian@damian-ml-machine.europe-west3-b.c.jetbrains-grazie.internal> Signed-off-by: dtransposed <> Co-authored-by: dtransposed <damian@damian-ml-machine.europe-west3-b.c.jetbrains-grazie.internal> * [Bugfix] Fix for the condition to accept empty encoder inputs for mllama (vllm-project#17732) Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> * [Kernel] Unified Triton kernel that doesn't distinguish between prefill + decode (vllm-project#16828) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com> Signed-off-by: Lucas Wilkinson <lwilkinson@neuralmagic.com> Co-authored-by: Lucas Wilkinson <lwilkinson@neuralmagic.com> --------- Signed-off-by: Thomas Ortner <boh@zurich.ibm.com> Signed-off-by: Stanislaw Wozniak <stw@zurich.ibm.com> Signed-off-by: rzou <zou3519@gmail.com> Signed-off-by: Mikhail Podvitskii <podvitskiymichael@gmail.com> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: Mengqing Cao <cmq0113@163.com> Signed-off-by: mgoin <mgoin64@gmail.com> Signed-off-by: jiang1.li <jiang1.li@intel.com> Signed-off-by: jiang.li <jiang1.li@intel.com> Signed-off-by: Chen Zhang <zhangch99@outlook.com> Signed-off-by: Aaron Pham <contact@aarnphm.xyz> Signed-off-by: windsonsea <haifeng.yao@daocloud.io> Signed-off-by: reidliu41 <reid201711@gmail.com> Signed-off-by: dtransposed <damian@damian-ml-machine.europe-west3-b.c.jetbrains-grazie.internal> Signed-off-by: dtransposed <> Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com> Signed-off-by: rshaw@neuralmagic.com <robertgshaw2@gmail.com> Co-authored-by: Stan Wozniak <77159600+s3woz@users.noreply.github.com> Co-authored-by: Thomas Ortner <boh@zurich.ibm.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by: Tyler Michael Smith <tysmith@redhat.com> Co-authored-by: Richard Zou <zou3519@users.noreply.github.com> Co-authored-by: Mikhail Podvitskii <podvitskiymichael@gmail.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk> Co-authored-by: Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Mengqing Cao <cmq0113@163.com> Co-authored-by: Michael Goin <mgoin64@gmail.com> Co-authored-by: Li, Jiang <jiang1.li@intel.com> Co-authored-by: Chen Zhang <zhangch99@outlook.com> Co-authored-by: Aaron Pham <contact@aarnphm.xyz> Co-authored-by: Michael Yao <haifeng.yao@daocloud.io> Co-authored-by: Reid <61492567+reidliu41@users.noreply.github.com> Co-authored-by: reidliu41 <reid201711@gmail.com> Co-authored-by: Jevin Jiang <jevin0change@gmail.com> Co-authored-by: d.transposed <damian.bogunowicz@gmail.com> Co-authored-by: dtransposed <damian@damian-ml-machine.europe-west3-b.c.jetbrains-grazie.internal> Co-authored-by: Gregory Shtrasberg <156009573+gshtras@users.noreply.github.com> Co-authored-by: Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by: Lucas Wilkinson <lwilkinson@neuralmagic.com>

…#17446) Signed-off-by: Mengqing Cao <cmq0113@163.com> Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>

…#17446) Signed-off-by: Mengqing Cao <cmq0113@163.com>

Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>

…#17446) Signed-off-by: Mengqing Cao <cmq0113@163.com> Signed-off-by: Yuqi Zhang <yuqizhang@google.com>

…#17446) Signed-off-by: Mengqing Cao <cmq0113@163.com>

mergify bot added the v1 label Apr 30, 2025

MengqingCao mentioned this pull request Apr 30, 2025

[Bugifx] Remove TritonPlaceholder from sys.modules #17317

Merged

mergify bot added the needs-rebase label May 2, 2025

MengqingCao force-pushed the fix_triton_placeholder branch from 229f545 to 67de861 Compare May 2, 2025 09:27

mergify bot removed the needs-rebase label May 2, 2025

MengqingCao changed the title ~~[Bugfix] Fix TritonPlaceholder conflicts with torch.compile~~ [Bugfix] Fix triton import with local TritonPlaceholder May 2, 2025

MengqingCao force-pushed the fix_triton_placeholder branch 4 times, most recently from f89bea8 to 46aea41 Compare May 2, 2025 10:03

[Bugfix] Fix triton import with local TritonPlaceholder

355fa4b

Signed-off-by: Mengqing Cao <cmq0113@163.com>

MengqingCao force-pushed the fix_triton_placeholder branch from 46aea41 to 355fa4b Compare May 2, 2025 10:40

Yikun mentioned this pull request May 2, 2025

Upgrade CANN version to 8.1.rc1 vllm-project/vllm-ascend#747

Merged

MengqingCao marked this pull request as ready for review May 2, 2025 12:25

MengqingCao requested review from tlrmchlsmth, WoosukKwon, robertgshaw2-redhat, njhill, ywang96, comaniac, alexm-redhat and mgoin as code owners May 2, 2025 12:25

Yikun mentioned this pull request May 5, 2025

Re-patch TritonPlaceholder on main to make CI happy vllm-project/vllm-ascend#753

Merged

houseroad approved these changes May 6, 2025

View reviewed changes

houseroad added the ready ONLY add when PR is ready to merge/full CI is needed label May 6, 2025

zou3519 approved these changes May 6, 2025

View reviewed changes

MengqingCao added 2 commits May 6, 2025 13:13

add ut for TritonPlaceholder

8440169

Signed-off-by: Mengqing Cao <cmq0113@163.com>

rename dummy decorator

16fe1c9

Signed-off-by: Mengqing Cao <cmq0113@163.com>

youkaichao approved these changes May 6, 2025

View reviewed changes

Isotr0py approved these changes May 6, 2025

View reviewed changes

Isotr0py enabled auto-merge (squash) May 6, 2025 05:59

Yikun approved these changes May 6, 2025

View reviewed changes

youkaichao disabled auto-merge May 6, 2025 09:53

youkaichao merged commit f9bc5a0 into vllm-project:main May 6, 2025
63 of 66 checks passed

Yikun mentioned this pull request May 6, 2025

[Core] Cleanup triton patch which has been fixed in vllm vllm-project/vllm-ascend#764

Merged

MengqingCao deleted the fix_triton_placeholder branch May 6, 2025 11:05

MengqingCao mentioned this pull request May 6, 2025

[MISC][pre-commit] Add pre-commit check for triton import #17716

Merged

RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025

[Bugfix] Fix triton import with local TritonPlaceholder (vllm-project…

5e31f15

…#17446) Signed-off-by: Mengqing Cao <cmq0113@163.com> Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>

mawong-amd pushed a commit to ROCm/vllm that referenced this pull request May 14, 2025

[Bugfix] Fix triton import with local TritonPlaceholder (vllm-project…

ac43b41

…#17446) Signed-off-by: Mengqing Cao <cmq0113@163.com>

gshtras added a commit to ROCm/vllm that referenced this pull request May 15, 2025

Re-apply vllm-project#17446

72dcbc3

Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>

zzzyq pushed a commit to zzzyq/vllm that referenced this pull request May 24, 2025

[Bugfix] Fix triton import with local TritonPlaceholder (vllm-project…

795893c

…#17446) Signed-off-by: Mengqing Cao <cmq0113@163.com> Signed-off-by: Yuqi Zhang <yuqizhang@google.com>

dtrifiro pushed a commit to red-hat-data-services/vllm that referenced this pull request Jun 10, 2025

[Bugfix] Fix triton import with local TritonPlaceholder (vllm-project…

f51df45

…#17446) Signed-off-by: Mengqing Cao <cmq0113@163.com>

dtrifiro mentioned this pull request Jun 10, 2025

fix bad merge with midstream-v0.9.0.1.0 red-hat-data-services/vllm#164

Merged

dtrifiro pushed a commit to red-hat-data-services/vllm that referenced this pull request Jun 10, 2025

[Bugfix] Fix triton import with local TritonPlaceholder (vllm-project…

b8bc53b

…#17446) Signed-off-by: Mengqing Cao <cmq0113@163.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Fix triton import with local TritonPlaceholder #17446

[Bugfix] Fix triton import with local TritonPlaceholder #17446

Uh oh!

MengqingCao commented Apr 30, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Apr 30, 2025

Uh oh!

mergify bot commented May 2, 2025

Uh oh!

MengqingCao commented May 2, 2025

Uh oh!

houseroad left a comment

Uh oh!

houseroad May 6, 2025

Uh oh!

MengqingCao May 6, 2025

Uh oh!

MengqingCao commented May 6, 2025 •

edited

Loading

Uh oh!

Uh oh!

mgoin commented May 6, 2025

Uh oh!

MengqingCao commented May 6, 2025

Uh oh!

Uh oh!

Uh oh!

[Bugfix] Fix triton import with local TritonPlaceholder #17446

[Bugfix] Fix triton import with local TritonPlaceholder #17446

Uh oh!

Conversation

MengqingCao commented Apr 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 30, 2025

Uh oh!

mergify bot commented May 2, 2025

Uh oh!

MengqingCao commented May 2, 2025

Uh oh!

houseroad left a comment

Choose a reason for hiding this comment

Uh oh!

houseroad May 6, 2025

Choose a reason for hiding this comment

Uh oh!

MengqingCao May 6, 2025

Choose a reason for hiding this comment

Uh oh!

MengqingCao commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

mgoin commented May 6, 2025

Uh oh!

MengqingCao commented May 6, 2025

Uh oh!

Uh oh!

MengqingCao commented Apr 30, 2025 •

edited

Loading

MengqingCao commented May 6, 2025 •

edited

Loading